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Abstract 

As  measurement  uncertainties  are  closely  tied  up  with  error  models,  it  might  be  of  in- 
terest to  review  a model,  which  the  author  cissigns  to  “Metrological  Statistics”.  Given 
that  the  random  errors  are  normally  distributed,  the  experimentalist  could  either  refer  to 
B.L.  Welch’s  concept  of  “effective  degrees  of  freedom”  or  to  the  multidimensional  Fisher- 
Wishart  distribution  density.  In  the  first  case,  different  numbers  of  repeated  measure- 
ments are  admissible,  in  the  latter  it  is  strictly  required  to  have  equal  numbers  of  repeated 
measurements.  In  error  propagation,  however,  only  the  latter  mode  of  action  opens  up 
the  possibility  of  designing  confidence  intervals  according  to  Student  and  confidence  el- 
lipsoids according  to  Hotelling.  Another  point  of  view,  closely  linked  to  the  choice  of  the 
numbers  of  repeated  measurements,  refers  to  the  customary  practice  of  attributing  equal 
rights  to  statistical  expectations  and  empirical  estimators.  However,  the  Fisher-Wishart 
distribution  density  suggests  using  only  the  information  which  is  realistically  accessible  to 
experimentalists,  namely  empirical  estimators.  For  the  handling  of  unknown  systematic 
errors,  either  the  existence  of  a (rectangular)  distribution  density  may  be  assumed  or, 
and  this  is  proposed  here,  they  may  be  classified  as  time-constant  quantities,  biasing  ex- 
pectations and  suspending  a lot  of  tools  and  procedures  of  error  calculus  well-established 
otherwise. 


1 Introduction 

The  joint  propagation  of  random  errors  and  unknown  systematic  errors  currently  places 
the  experimentalist  in  the  following  dilemma. 

In  regard  to  the  propagation  of  random  errors,  there  are,  at  least  in  principle,  two 
different  choices.  If  one  is  willing  to  accept  unequal  numbers  of  repeated  measurements 
of  the  physical  quantities  to  be  combined  within  a given  function,  one  has,  in  order  to 
express  the  influence  of  random  errors,  to  resort  to  B.  L.  Welch’s  sophisticated  concept 
of  so-called  numbers  of  effective  degrees  of  freedom  [8].  However,  this  procedure  is  tied 
up  with  difficulties:  it  is  restricted  to  independent  variables. 

Though  B.  L.  Welch’s  concept  completely  exhausts  the  information  implied  in  meas- 
ured data,  unfortunately,  from  a metrological  point  of  view,  it  is  cumbersome  to  handle 
and  obstructs  the  view  to  existing  simpler  procedures.  On  the  other  hand,  if  the  ex- 
perimentalist preferred  equal  numbers  of  repeated  measurements,  he  would  — if  need 
be  — have  to  give  away  part  of  his  information,  namely  that  which  is  carried  by  the 
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excessive  numbers  of  repeated  measurements  of  the  variables  involved.  Up  to  how,  the 
disregarding  of  excessive  numbers  is  regarded  as  unfavourable.  In  spite  of  this  view, 
just  this  precaution  opens  up  a toolbox  of  applied  statistics  hitherto  closed  to  metrolo- 
gists,  as  only  with  equal  numbers  of  repeated  measurements,  is  the  experimentalist  in 
a position  to  call  upon  the  standard  model  of  statistics  for  jointly  normally  distributed 
random  variables,  i.e.  the  Fisher-Wishart  density  [3].  The  advantages  gained  in  that 
way  outweigh  by  far  the  “lost  information” , as  relatively  few  repeated  measurements  of 
experimental  set-ups,  operating  in  a stationary  mode,  are  able  to  locate  accurately  the 
respective  physical  quantities.  After  all,  in  error  propagation  the  experimentalist  may 
define  confidence  intervals  according  to  Student  (Gosset)  including  any  number  of  vari- 
ables. In  least  squares,  he  may  even  establish  multidimensional  confidence  intervals,  and 
last  but  not  least,  certain  problems  of  classical  error  calculus,  such  as  the  Fisher-Behrens 
problems  no  longer  arise. 

In  regard  to  the  interpretation  and  propagation  of  unknown  systematic  errors,  the 
situation  is  not  simpler.  Let  us  assume  that  an  unknown  systematic  error  /,  constant  in 
time,  is  confined  to  an  interval  of  the  kind^ 

-fs<f<fs,  /.  >0.  (1.1) 

Now,  the  experimentalist  may  either  assign  a postulated  probabilty  density  to  /,  usually 
a rectangular  density  [7], 

= (1-2) 

or  he  may  set  without  exception 

/ = constant,  (1.3) 

where  / lies  anywhere  within  (1.1).  The  latter  interpretation  introduces  biased  estim- 

ators, leading  to  a break-down  of  many  procedures  of  efror  calculus  otherwise  well- 
established. 

Seen  mathematically,  both  interpretations  should  be  justified.  In  the  case  of  (1.2), 
the  combination  of  random  and  systematic  errors  should  be  carried  out  geometrically, 
in  the  case  of  (1.3),  arithmetically.  Regarding  (1.3),  the  author  suggests  adding  lin- 
early Student’s  confidence  intervals  to  appropriately  designed  worst-case  estimates  of 
the  propagated  systematic  errors,  and  no  probability  statements  should  be  associated 
with  so-defined  overall  uncertainties. 

2 Error  propagation 

The  fundamental  error  equations  of  Metrological  Statistics  are  given  as  follows  [4].  Let 
xo  designate  the  true  value  of  the  physical  quantity  x to  be  measured.  Furthermore,  let 
£;  be  the  random  error  and  fx  = constant  the  unknown  systematic  error  corresponding 


^Should  the  interval  be  unsymmetrical  to  zero,  it  could  be  symmetrized  by  subtracting  the  halved  sum  of  the 
Upper  and  lower  boundary  — the  same  quantity  would  have  to  be  subtracted  from  the  data. 
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to  (1.1).  We  then  have 


Xi  = Xq  + El fx,  / = l,...,n. 


Let  Hx  = xo  + fx  be  the  expectation  of  the  random  variable  X = {xi,X2,. ..  ,x„},  so 
that  the  Xi  are  some  of  its  realizations.  We  then  find 

Xi  = fix  + et,  I = 1,...  ,n.  (2.2) 

Furthermore,  let  x — l/n^"^j  xi  denote  the  arithmetic  mean.  We  then  have  the  useful 
identities 

Xi  Xq  "h  (^X[  /^x)  “h  fxt  X — Xq  "f“  (^X  /Tx)  “h  fx'  (^*^) 

While  the  arithmetic  mean  is  biased,  the  empirical  variance 


is  not.  For  the  time  being,  let  us  consider  just  two  quantities  to  be  measured,  x and  y. 
As  robust  and  simple  uncertainty  assessments  are  a matter  of  linearization,  the  overall 
uncertainty  of  a given  function  (j>{x,y)  is  proposed  to  be  [5], 


S +(^\  S2  + ^ f + ^ f 

\dy)  dx  ^ dy 


where  ts,p  (n  - 1)  is  the  Student-factor  corresponding  to  a confidence  level  P.  We  dis- 
tinctly see  how  the  empirical  covariance 


1 ^ ^ 

rrr  ~ ~ 


enters  the  empirical  variance  of  the  4>{xi,yi)-,  I = 1, . . . , n,  given  by 

4=5^|;i'^(»'..»)-^(j,5)f=(g)  »^+2(|)(|)..,+(g)  4 

, The  final  result 

(j>{x,y)±U4,  (2.6) 

is  expected  to  localize  the  true  value  0 (xo,  j/o)  with  “reasonable  certainty”  — but  no 
proper  confidence  statement  should  be  added,  as  is  a mixture  of  a statistical  and 
a non  statistical  component.  The  last  term  in  (2.5)  may  overestimate  the  uncertainty, 
on  the  other  hand  linearization  errors  have  been  negleted.  After  all,  this  uncertainty 
statement  should  fulfill  the  prerequiste  to  be  safe,  robust  and  simple. 

If  there  are  m quantities  to  be  measured,  we  replace  the  notation  x,  i/  by  xi , X2, . . . , x^. 
Then  the  overall  uncertainty  of  the  final  result 

0(Xi,X2,...,  Xjn  ) ± 
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is  given  by 


When  (2.5)  and  (2.7)  are  compared,  it  becomes  obvious  that  the  proposed  formalism 
of  error  propagation  works  like  a building  kit,  perspicuous  and  easy  to  handle.  There  are 
arguments  against  (2.7),  in  particular  that  an  experimentalist  who  wishes  to  design  his 
uncertainties  in  this  way,  would  have  to  know  the  complete  set  of  repeated  measurements, 
in  other  words,  the  complete  empirical  variance-covariance  matrix 

s = (sij') , i,j  1, 2, ...  , m,  (2-^) 

of  the  input  data.  Arguably,  this  is  true,  but  in  the  days  of  computers  and  the  internet 
such  a challenge  should  no  longer  be  apt  to  provoke  difficulties  worth  mentioning.  An- 
other argument,  that  (2.7)  might  overestimate  overall  uncertainties,  should  be  judged  in 
view  of  the  unique  role  of  metrology  in  science.  Standing  “between”  theory  and  experi- 
ment, metrology  pursues  the  idea  to  localize  reliably  the  value  of  the  physical  quantity 
in  question. 

3 Least  squares 

Let 

A/3^x  (3.1) 

be  a linear  system  of  equations  to  be  adjusted.  Here,  A designates  the  m x r design 
matrix  of  rank  r,  /3  the  r x 1 vector  of  unknowns  and,  finally,  x the  m x 1 vector  of 
the  observations  or  input  data.  We  assume  tn>  r.  The  idea  of  least  squares  is  of  purely 
geometrical  origin. 

In  what  follows,  A^  denotes  the  transpose  of  A.  The  idea  is  to  project  the  vector  x 
by  means  of  a projection  operator 

P = A(A^A)-^A^  (3.2) 

orthogonally  onto  the  column  space  of  the  matrix  A,  and  the  result  is 

{A'^Ay^A'^x.  (3.3) 

As  the  solution  vector  P is  linear  in  the  input  data,  the  transfer  of  (2.7)  to  its  components 
Pk,  h = 1, . . . ,r,  is  straightforward. 

Clearly,  the  orthogonal  projection  is  in  no  way  dependent  on  the  error  model  implied. 
In  contrast  to  this,  the  latter  turns  out  to  be  crucial  in  regard  to  uncertainty  assessments. 
Let  us  consider  a set  of  single  observations 

Xi=  xo^i  +Bi  + fi=  xo^i  A{xi-pii)  + fi,  i = l,...  ,m,  (3,4) 

being  the  input  data,  where  E {A,}  = pi.  Writing  (3.4)  in  vector  form,  we  have 

X = Xo  + {x-  p)  + f (3.5) 
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, where 

X ~ {X\,  X2)  • ■ • ) Xfn)  ) Xo  — (xQjI,  Xq^2^  • * • j ^0,m)  ? 

M ~ (M1  1 A*2)  ■ • • ) /^m)  1 / ~ (/l ) /2)  • • • ) /to)  ) /s,i  ^ ft  ^ /s,i  • 

Given  equal  variances  = E ^{Xi  — the  minimized  sum  Qmin  of  squared 

residuals  of  the  adjusted  system  (3.1)  should  yield,  according  to  quite  familiar  procedures, 
an  estimator  ss  cr^.  However,  from 

Qmin  — i^X  Hx)  (x  Hx)  , 

we  obtain  something  different,  namely 

E {Qmin}  = (7^  (m  - r)  + /^/  - f'^Pf.  (3.6) 

As  we  see,  even  the  simplest  of  all  associated  least  squares  procedures  breaks  down, 
should  the  model  of  time-constant  unknown  systematic  errors  be  accepted.  At  the  same 
time  the  related  basic  tool  linked  to  Qmin  and  frequently  used,  namely  the  test  of  con- 
sistency of  the  input  data  based  on  the  criterion 

Qmin/a  ^ m T 

breaks  down  as  well.  Indeed,  during  many  decades,  time  and  again,  the  observation 

Qmm/s^  :^m-r 

has  stunned  experimentalists  [2],  so  that,  in  the  adjustments  of  the  fundamental  physical 
constants,  even  the  abolition  of  least  squares  has  been  considered  [1].  However,  in  view 
of  (3.6),  these  observations  are  understandable. 

After  all,  a least  squares  adjustment  of  biased  input  data  requires  arithmetic  means 

Xi  = xo^i  + {xi-  fi,  i = l,...,m,  (3.7) 

so  that  the  empirical  variances  and  covariances 

1 " 

Sij  ~ r ^ ^ (,Xil  Xj)  {Xji  Xj),  Sii  — Sj  , (^’6) 

~ i=l 

are  known  a priori.  Replacing  (3.5)  by 

X = xo  -t-  (x  - /x)  -f-  / (3.9) 

. instead  of  (3.3),  we  find 

^={A'^Ay^A'^x.  (3.10) 

A matter  of  similar  concern  refers  to  the  break-down  of  the  Gauss-Markoff  theorem.  In 
view  of  (3.9),  the  solution  vector  P is  biased,  so  that  the  experimentalist  is  no  longer  in 
a position  to  obtain  a weight-matrix  from  the  variance-covariance  matrix  of  the  input 
vector  X.  Consequently,  simple,  optimized  adjustments,  to  which  we  are  customarily 
used,  must  be  ruled  out.  Nevertheless,  we  may  multiply  (3.1)  from  the  left  with  any 
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non-singular  weighting  matrix,  e.g.  with  a diagonal  one, 

G {gi,g2,--‘  ,9m},  9i  = —,  (3.11) 

Uxi 

and  adjust  the  weights  gi  by  trial  and  error  in  order  to  find  the  shortest  possible  uncer- 
tainty intervals.  As  has  been  shown,  this  method  is  also  able  to  detect  inconsistencies 
among  the  input  data,  [6] . Indeed,  as  a non-singular  weight-matrix  cannot  shift  the  true 
solution  vector  j3o,  we  are  allowed  to  proceed  this  way. 

To  assign  uncertainties  to  the  components  ^k',k  = 1,. . . ,r  of  the  solution  vector  P, 
we  refer  to  (2.7).  To  abbreviate  the  notation,  we  set  in  (3.10) 

B = A{A^Ay^  (3.12) 

where  the  elements  of  the  matrix  B will  be  designated  by  bih-  Upon  insertion  of  (3.9) 
into  (3.10),  we  arrive  at 

B = B^xo  + ($  - /i)  + (3.13) 

Evidently,  /3q  = B'^xq  is  the  true  value  of  the  estimator  Setting  = E {$}  = 
Po  4-  B'^f,  we  may  define  the  theoretical  variance-covariance  matrix 

which,  however,  remains  numerically  inaccessible.  Consequently,  the  only  thing  we  can 
do  is  to  resort  to  the  empirical  variance-covariance  matrix 

S0  = = B'^sB,  k = l,2,...,r,  (3.14) 

whose  elements  are  given  by 

m 

(3.15) 

ij 

Clearly,  the  Sij  are  the  elements  of  the  empirical  variance-covariance  matrix  s of  the 
input  data,  as  has  been  stated  in  (2.8)  and  (3.8). 

These  procedures  presuppose,  as  has  been  pointed  out,  equal  numbers  of  repeated 
measurements  within  each  of  the  m means  (3.7).  The  components  Pk  of  the  solution 
vector  may  be  written  as 


^ n m 

0k  = -Y'Pki  with  hi  = y^bikXii-,  k = l,..., 

n “ 


r. 


(3.16) 


1=1 


i=l 


Evidently,  the  l3ki  are  independent  and  normally  distributed.  Let  denote  the 
expectations 


of  the  h-  Looking  for  just  any  one  of  the  ft, 

5 ts,p  {iT'  — i)  ^ ^ a , (^^  - 1) 

h ^ ft  -I  ^ 


(3.17) 


(3.18) 
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is  a confidence  interval  according  to  Student,  where  ts,p{n-  1)  is  the  Student-factor. 
This  interval  localizes  with  confidence  P. 

The  components  of  the  third  term  on  the  right-hand  side  of  (3.13)  are  given  by 

m 

fp,=Y.^ikfu  k = l,...,r.  (3.19) 

i=l 

Worst-case  estimates  are 

m 

k = l,...,r.  (3.20) 

t=i 

After  all,  the  overall  uncertainties  of  the  components  of  the  solution  vector  con- 
sidered and  employed  individually,  are  proposed  to  be 

“A  = + fsA  - k=l,...,r.  (3.21) 

4 Uncertainty  spaces 

The  component  representation  of  (3.13), 

m m 

0k  = 00, k + ^ bik  {Xi  - fii)  + ^ b^kfi  (4.1) 

»=1  i=l 

reveals  the  couplings  between  the  least  squares  estimators.  Those  due  to  random  errors 
may  be  expressed  by  means  of  Hotelling’s  density  [3].  The  last  term  on  the  right-hand 
side  of  (4.1), 

m 

Uk  = k=\,...  (4.2) 

i=l 

expresses  the  couplings  due  to  systematic  errors.  The  r components  map  the  m- 
dimensional  hypercuboid 

-fs,i<fi<fs,u  i = (4.3) 

onto  the  r-dimensional  space,  yielding  a convex  polytope.  Both  solids  may  be  combined 
to  an  overall  uncertainty  space,  resembling  a “convex  potato”.  Figures  1-3  show  the 
confidence  ellipsoid,  the  “security  polytope”  and  the  combination  of  both  to  an  overall 
uncertainty  space  for  the  example  of  a least  squares  adjustment  of  a circle. 

5 Conclusion 

As  computer  simulations  reveal,  the  approach  presented  here  leads  to  measurement  un- 
certainties safeguarding  physical  objectivity  in  the  sense  that  uncertainty  intervals  re- 
liably locate  the  values  of  the  physical  quantities  in  question.  With  such  a distinct 
statement,  the  traceability  of  units  and  standards  will  certainly  be  maintained. 
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Fig.  1.  Confidence  ellipsoid,  security  polytope,  overall  uncertainty  space  resembling  a 
“convex  potato”. 
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